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COMPUTING THE ANALYSIS OF VARIANCE 
OF FACTORIAL EXPERIMENTS ON AUTOMATIC COMPUTERS 


(A Preliminary Report) 


by 
J. H. Cameron. 
Statistical Engineering Laboratory 
National Bureau of Standards 

The purpose of this nete is to describe the method of Yates? 
which seems well suited for computing the analysis of variance of 
tactorial exper iments 6m large scale computers. The analysis for 
the 2" and 3" series is given ed the analysis for the general fac- 
torial indicated. The detailed analysis of “sxauples of the 2” a 
3® gosrere are given. 


i, Amalysis of variance of the 2" factorial designs. 


in a factorial design the effects of a number of factors sre 
investigated simulteneously. im the 2" factorial design there are a 
factors (such as i gnat cae eee etc.) each of which is studied 
at two levels (e.g. 20° and 30° for temperature). Test conditions are 
set up corresponding to each of the possible combinations of the two 
levels of the n factors (i.e. 2” combinations in all) and an observa- 
tion is recorded for each. 

The 2" combinations can be designated by a series of n subscripts 
@kich are either 0 or 1 depending on whether the factor is at its low 
or high level, For example for n = 3 as shown below each of the # . 
8 possible combinations can be oon seers by X abe? where a, b, and c 
are either 0 or 1. 

In the procedure given here it is necessary that the observations 
be presented in a column of 2" values in the order shown below for 
the case n = 3, The extension te other values of n is obvious. 


a 

¥. Yates, “Design and Analysis of Factorial Experiments", Imperial 
Bureau of Soil Science, Tech. Comm. No. 35, Harpenden, 
1937. 
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Once this column of data is in the machine a column of "sums 
ang differences" also containing 2" values is obtained from it by 


tabulating the Doe. 


sums of successive pairs of observations followed 
ay 


differences between the elements of the same pairs. Thus 
fer 2 = 3 we have 


by the 2® 


The same process is repeated on the first column of "sums and 
differences" to form a second column of "sums and differences”, and 
So on for each successive column so formed until the n-th column of 
"sums and differences” is obtained. | 

The square of an entry in the n-th column of "sums and differences" 
divided by 2" corresponds to a single degree of freedom in the analysis 
of variance, The single degrees of freedom for the several main effects 


and interactions come out in the following sequence 
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CF (correction factor for the mean) E 


The entries in the n-th column of “sums and differences" divided 
by goa gives an estimate of the average difference between the levels 


of a factor. 


Computational checks. 
1. The sum of the entries in the n-th column of “sums 
” n n 

differences" is equal to 2 Xooo000...0. {i.e. 2 times 
the leading element in the data as presented to the 
machine? | 

2. The sum of the squares of the entries in the n-th 
column is equal to 2" times the sum of squares of the 
elements of the original column of observations. 
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Analsis of Variauce for 20 Factyrial Design: Example for n = 4 


evel of factor observed value* 
eae B Cc D fcis treatment combination 


0 60. oO 0 
i 0. 0 0 
Of 21 0. 0 
Le ls WO. 0 | 
0 4) l 0 | 
Te 30%, “Lhe RO 
OF ky ee 
el a 0 
Yess (Qari Oem | 
LE SO ON ey 
On el! 302" 
i len. OF ee 
0-0) ws 
TPO), els 6d 
OF 1 VY a 
1 panes Wee a i | 


ore ren from table of random 
numbers. 


Sum 747 
Pare | Dyvaves Alou : 
Owela: @ 2D = an hl 
Bis oo = ¢* 


————— 


2 


“dD Dp 
ria n 
en a L] 
T* ofr ; of 
lenela vf Vt Lemer 
Ho cer pedio 
358,004 934,15 Mean 39875. 5625 
12,32) - 13, 995 A 770.0625 
35 72/ 38, OS Ba Be, SE28) 
Ry nt. Ap 7, 5625 
289 “2,125 @ _1%,0625 
6561 o.rS ~ Ae 410.0685 
9409 | -hu25 Be 588.6628 


1%, $39 /4,§25 Age 255. 5625 


gud > aan D $0625. 
) ‘S041 9,628 AD _- 315.0625 
23,409 ey ae) BD 1463 0025 
44 ~ 2.625 ABD 47,6625 
27225 20.65 ao [701,225 


46225 Ab, 875 AcD 2 §€9, 0625 


3,225 | —!'4.375 BoD «gb Shas 
A4| -2.625 . ABCD 27, 5625 
ee 


460 152,176 
152,96 


AL ell.o ced 


A 


: | 20.020 aptal 


# ate thet This volue v5, 
460 = 2*(60) Terre the gaand average | 


152,176 = 2% (41 o0/) 


oa ’ i 
n 
i 
1 
- ; ; : 
; ot 
¥ 
F i 
a ) 
i 
i 
1 
i 
or - 
i ‘ 
‘ 
5 1 
H 
* i 
os 7 
j , i 
i ni - 
“ i 7 ' s 
a 
me : Ms 
TPs 1) o 
i 1) <n Ai ‘ j 
7 * 7 i) v 
rc 7 ani 
ie ee os eilll 
. + Tenino a 
7 i 
™ if 
; ' 


ae in 2S ae, 


Let the “observations be presented according to the scheme shown 
here for n = 4, 


Designation of ‘Level of factor 
observation A B Cc D 
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The 3" Bbegrvatiods give rise to a -1 successive sets of 3 iad 


values. A column of "sums and differences" having 3" elements is 
formed as follows Q) the sums of the 3 elements of the sco sets 
tabulated in sruder ee these are followed by Os differences 
: and. finally (3) the 
value for each of 
column of “sums — 


renege SUE “ptiarriceedommaaer tiny onetime aaa ES 


Fee 7 cE Taeeodont ae 


Pati 
ad 
eu 


en roved ana te pe te ‘ 
“G9 8 A eb eevmoadg 4 
Oo 0 er ee ° | 
| ii a an oy hog ; 
uae : aust ; | 
9! ea 


Ms me mh. & OO eo 2) 


8 


A] 
fy) 
up 
f 
£ 
x 
$ <0 
ce | 
g 
0 
2] 
6 


: rf 

\ o ‘ 

w : i 
td { : 

| - Hy 


&. to aise ovieasooua i a) od. ‘gate Sete endiseraaaae Re. ant” oe ae 
at atasmare : aoe opt ae ‘bas enue 38 anutoo 8 ee 


ae 


This same procedure is repeated on the successive columns of 
"sums and differences" until the n-th column of "sums and differences" 
is obtained. The square of the elements, D, of this n-th column di- 
vided by a corresponding factor, d, gives a single degree of freedom 
for the main effects or interactions in the analysis of variance 
table, These single degrees of freedom come out in the following 
order, (The subscript 1 refers to the linear component and the sub- 
script 2 refers to the quadratic component. ) 


The appropriate divisors for the squares of the entries in the 
n-th column are given by raising the triple (3, 2, 6) to the n-th 
power according to the following rule: ; 


for n = 2 (3, 2, 6)? = (3,2,6) (3,2,6) 
3 2 6 
3 2 6 
9 6 18 
6 4 12 
18 12 36 


| The sequence of divisors for the corresponding elements of the 
n-th column being . 


9, 6, 18, 6, 4, 12, 18, 12, 36 
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for n= 3 (3,2,6)° = (3,2,6)2 (3,2,6) 


9 6 18 6 4 12 18 12 36 
2.2" 6 
27 18 54 18 12 36 54 36 108 
18 12 36 12 8 24 36 24 72 


54 36 108 36 24 72 108 72 216 


The sequence of divisors being 

247, 18, 54, 18, 125 wocccoecey *@25 108, 72, 216. 
The extension to larger values of n is carried on in the same manner. 
The case n = 4 is given in the worked out example. 


Computational checks 
(1) The sum of squares of the original observations is 
2 
equal to the sum &£ > ° 


(2) The sum of the n-th column of "sums and differences" 
ean be checked using the following procedures from 
the successive sets of three values of the observa= 
tion column form a column of the 3°! quantities ob- 
tained by taking 3 times the first element of the 
set minus the middle element plus the third element. 
Repeat this process on the column so formed. 

After n repetitions one final number remains. This 
number is the check sum for the n-th column of sums 
and differences. 


Combining the individual degrees of freedom 


The analysis of variance table is usually written in the form 
shown here for n = 4. 
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Analysis of variance table 
Degrees of Freedom 
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Factor Sum of Squares is Sum of df. mean square 
CF CF 1 
A A, Ag 2 
B B,+By 2 
AB A, By +AgB, +A, By +AgBy 4 
C C, #Co 2 
AC . sum of squ22 
Be : df. 
ABC ° 
D ° 
ADI fh etc. . 
BD x 
ABD i 
cD 
ACD 8 
BCD 8 
ABCD 16 


In order to convert the column of values  . (corresponding to 
the 81 individual degrees of freedom for n = 4) into this conventional 
form for the analysis of variance tables, ome can use successive 
triads of the column of 81 individual d.f. Two columns of values are 
formed from the 27 triads: (1) the first element of each triad is 
recorded in sequence in the first column, and (2) the sum of the last 
two elements of the 27 triads is recorded in the second column. 

These two columns are then combined into a singlé column of 54 
elements by writing the second column at the end of the first. This 
process is repeated n = 4 times and the resulting column is the "sum 
of squares" column in the standard analysis of variance table. 
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The degrees of freedom associated with the 16 sums of squares 
for m = 4 may be obtained by raising the couple (1,2) to the 4-th 
power a8 foliowss | | 

(1,2) (1,2) = (1,2,2,4) 

(25209 | (1,2)9 (by2) © (2,252p45254,4,8) 
(1,2)* = (1,2)9 (1,2) = (1p 22p4)254p4,8,2,4y4,8)4,8,8, 16) 


ete. 


The mean square is obtained by dividing the "sum of squares” 
by this divisor. 
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8, Analysis of variance of general factorial design 


The proper sequence for presenting the column of observations 
can probably best be described by giving an example. For 24 x3 x2 
factorial the observations are presented in the order corresponding 
te the following combination of the factors 
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In gener2l for ak x mxr factorial (k>jgm,r) the observations 
are put in order go that the k levels of the factor A are recorded 
in their sequence mr times, The corresponding index for B is ob- 
tained by writing each index for B k timesand repeating this sequenc: 
Kr. times, The € index is obtained by writing each suecessive index 
for C km times. For factorials with other than three fectors, the 
Same approach is applied. 
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The procedure then consists of forming a 2nd, 3rd, 4th....etc. 
's €olumn of "sums aud differences" by applying the appropriate set of 
Jinear functions for the mumber of levels of each successive factor 


to the preceding column. 


The final column of “sums and differences” 


go Gbiaiucd is squared and divided by an appropriate constant to 
give the individual degrees of freedom of the analysis of variance. 


This individual degrees of freedom come out in the following 


order: 
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for the 4x 3 x 2 factorial we have 


divisor 
Cr 24 
Ay 126 
Ag 120 
By L6 
A,B, 80 
AgBy 16 
A,B, 80 
By 4& 
A,B 240 
AgBy 46 
AB. 240 
¢ 24 
A,¢ 126 
Age 24 
A.C 126 
B,C 16 
A,B,C 80 
AgB,C 16 
A,B,C 80 
Cc && 
A, 5,C 249 
Ap BoC 48 
AgByt 240 


The column ‘of divisors are those used to divide the correspondire 
squares of the elements in the ord Qast column) of “sums and difia@ 
ences", | 


The appropriate divisors are obtained by writing the coefficients 
diy (given in the table on page 13) in the form | 


(dig dia coo ody) for the k levels of A 


and multiplying by the successive terms of 
(doa 652 co. ode) for the m levels of B 


to get 
Sia dy? aig d51 cocoGyy ani? ar1 dna°°e Spy qian? then 


carrying om the process until all factors are accounted for, 
For the 4x 3x 2 factorial we haves 


KX 4 20 4 20) (3,2,6) (2,2) 
Doing the multiplication we get 


(12 60 12 60 8 40 8 40 24 120 24 126) (2,2) = 


(24 120 24 120 16 80 16 80 48 246 48 240 
24 120 24 120 16 80 16 80 48 240 48 240). 
as shown on page 15. 


Computational check 


The sum of squares of the observatiomis equal to the sum of 
the individual degrees of freedom, i.e. = X® = £ Pa 6 


Combination of individual degrees of frecdom 


A combination of individual degrees of freedom is accomplisted 
by forming two coluams using sets of k (the number of levels of factor 
A) individual degrees of freedom in the D®/d column. Two columns are 
formed: the first by writing the first element of each set of k is 
sequence the second by adding the next (k-1) elements of each set, 
The two columns are formed into one hy appending the second columg 
to the end of the first. This column is then operated on using sete 
of m elements (the number cf levels of factor B). Two columns are 
again formed as befere and combined. This process is carricd 
fer all factors, 
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fhe number of degrees of freedom associated with the final 
column of "sums of squar@s" are given by the term of the product 


(1,k-2) (l.m-1) (1,r-1) 


For the 4 x 3 x 2 example we have (irom page 15) 


Using sets of 3 from this column we get 


CF B 
Coe BC 
A AB 

AC ABC 


And finally, using sets of 2 


CF Cc 
A AC 
B BC 
AB ABC 
Thus we have 

ad.f, 
CF 1 

3 Sk ee eee. © 

B \ 2 2 

AB 6 
Cc nf 
3 
BC 2 
ABC 6 


The sequence of d.f. is obtained by multiplying 
(1,3) (3,2) (1,1) = (1 3 2 6) (1,1) 
5 = (1,3,2,6,1 3 2 6) 


